Topic Modeling Using Collapsed Typed Dependency Relations
نویسندگان
چکیده
Topic modeling is a powerful tool to uncover hidden thematic structures of documents. Many conventional topic models represent documents as a bag-of-words, where the important linguistic structures of documents are neglected. In this paper, we propose a novel topic model that enriches text documents with collapsed typed dependency relations to effectively acquire syntactic and semantic dependencies between consecutive and nonconsecutive words of text documents. In addition, we propose to enforce coherent topic assignments for conceptually similar words by generalizing words with their synonyms. Our experimental studies show that the proposed model and strategy outperform the original LDA model and the Bigram Topic Model in terms of perplexity; and our performance is comparable to other models in terms of stability, coherence, and accuracy.
منابع مشابه
Generating Typed Dependency Parses from Phrase Structure Parses
This paper describes a system for extracting typed dependency parses of English sentences from phrase structure parses. In order to capture inherent relations occurring in corpus texts that can be critical in real-world applications, many NP relations are included in the set of grammatical relations used. We provide a comparison of our system with Minipar and the Link parser. The typed dependen...
متن کاملTyped Dependency Relations for Syntactic Analysis of Thai Sentences
This paper describes a preliminary effort in identifying many different types of relations among words in Thai sentences based on dependency grammar. The relation is represented as a triple containing the pair of words and their relation. So far, the current representation contains 35 grammatical relations. The dependencies are all binary relations. That is, a grammatical relation holds between...
متن کاملDetecting Opinion Sentences Specific to Product Features in Customer Reviews using Typed Dependency Relations
Customer reviews contain opinions of the customers who purchased products and expressed opinions concerning their satisfactions and criticisms. Due to vast availability of product reviews in the web, it is extremely time-consuming and at times confusing for a new customer to manually analyze the reviews prior to buying a product. Reviews generally involve the presence of product feature specifi...
متن کاملExtracting Noun Phrases in Subject and Object Roles for Exploring Text Semantics
In tune with the recent developments in the automatic retrieval of text semantics, this paper is an attempt to extract one of the most fundamental semantic units from natural language text. The context is intuitively extracted from typed dependency structures basically depicting dependency relations instead of Part-Of-Speech tagged representation of the text. The dependency relations imply deep...
متن کاملDeciding Entailment and Contradiction with Stochastic and Edit Distance-based Alignment
Analysis stage. Our goal at this stage is to compute linguistic representations of the passage and the hypothesis that contain as much information as possible about their semantic content. We use typed dependency graphs generated by the Stanford parser (Klein and Manning, 2003; de Marneffe et al., 2006), which contain a node for each word and labeled edges representing the grammatical relations...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014